Why Fuzzy Sequential Patterns can Help Data Summarization: An Application to the INPI Trade Mark Database [FUZZ4383]

نویسندگان

  • Céline Fiot
  • Anne Laurent
  • Maguelonne Teisseire
  • Bénédicte Laurent
چکیده

Mining fuzzy rules is one of the best ways to summarize large databases while keeping information as clear and understandable as possible for the end-user. Several approaches have been proposed to mine such fuzzy rules, in particular to mine fuzzy association rules. However, we argue that it is important to mine rules that convey information about the order. For instance, it is very interesting to convey the idea of time running in rules, which is done in fuzzy sequential patterns. In this paper, we thus focus on fuzzy sequential patterns. We show that mining such rules requires to manage a lot of information and we propose algorithms to remain efficient in both memory use and computation time. Our proposition is assessed by experiments. Particularly, we apply our algorithms on the INPI database which stores almost 2 million trademarks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fuzzy Sequential Patterns Summarization with Lattice Structure

Data mining is new but an interdisciplinary field utilizing statistics, machine learning, and other methods. In recent years, fuzzy logic has also been applied to augment data mining. The application of fuzzy logics makes the mining results more understandable and interpretable, apart from being useful and informative. Fuzzy rules are useful to summarize large databases. Several studies are don...

متن کامل

Systematic literature review of fuzzy logic based text summarization

Information Overloadrq  is not a new term but with the massive development in technology which enables anytime, anywhere, easy and unlimited access; participation & publishing of information has consequently escalated its impact. Assisting userslq    informational searches with reduced reading surfing time by extracting and evaluating accurate, authentic & relevant information are the primary c...

متن کامل

An approach to optimize Fuzzy Time-Interval Sequential Patterns using Multi-Objective Genetic Algorithm

Sequential pattern mining, which discovers frequent subsequences as patterns in a sequence database, is an important data-mining problem with broad applications. From these discovered sequential patterns, we can discover the order of the patterns; however, they cannot tell us the time intervals between successive patterns. Accordingly, Chen et al. have proposed a fuzzy timeinterval (FTI) sequen...

متن کامل

A Proposed Data Mining Methodology and its Application to Industrial Procedures

Data mining is the process of discovering correlations, patterns, trends or relationships by searching through a large amount of data stored in repositories, corporate databases, and data warehouses. Industrial procedures with the help of engineers, managers, and other specialists, comprise a broad field and have many tools and techniques in their problem-solving arsenal. The purpose of this st...

متن کامل

Fuzzy Sequential Patterns for Quantitative Data Mining

The amount of generated and collected data has been rapidly increasing in the last decades; these huge data and information collections are far outpacing our abilities to analyse, summarize, and extract knowledge. This explosive growth in stored data has generated a need for new techniques that can help in transforming these large quantities of data into useful comprehensible knowledge. These t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006